SEQUENCE USTTING 



(1) 



GENERAL INFORMATION: 



(i) APPLICANT: 




Lebnm, Michel 
5vfliiinr>H ) Alain 
Freyuinct, Georges 
DcGryB©, Eric 



(ii) TITLE OF INVENTION: Mutated 5<oolpyiuvyl5hikimate-3-pbosphate synthase, 
gene <**<ing for this protein Mid transformed plants onwiflmfwg gone 

Oii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Connolly, Bove, Lodge and Hutz 

(B) STREET: 1220 Market Street 
(Q CITY: WiJ 

(D) STATE: Delaware 

(E) COUNTRY: U.S Ji. 

(F) ZIP: 19899 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPUCATION DATA: 

(A) APPUCATION NUMBER: 

(B) FILING DATE: 
(Q CLASSIFICATION : 



(vii) PRIOR APPUCATION DATA: 

(A) APPUCATION NUMBER: PCT/FR9670U: 

(B) FILING DATE: 18-JUL-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Robert G. McMorrow, Jr. 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (302) 658-9141 

(B) TELEFAX: (302) 658-5614 

12) XMfOftMATXOM rt» ftQ ID HOll^ 

in noma owu«mjmcti 

IA) XXMOTHi 1713 paira 
(8) TYfti nuelali acid 
tC) STRAW OC Outfit doubla 
tOl TOPOLOGY :/lin»ar 

(111 HOLCCULt 

(vll 0AI0INAL SOUKCSt 

(At OHOAMIWi U« nayi 

(•I STRAIN! Hack M**la*n Bmt 

If] TX^SUC TYffI: Callua 

mil V**piA7t SOUMBt 

LIMAATi la Ada gtlO 
1} OJJKtt pA*A«MLOU 



1*11 sequence octoimowt icq id uoui 

AATCAATTTC ACACMOAAA GMCTATttC CKTOATTA09 AArTOSOGOC a»0C9O9T9 €0 

ATOQ99090C WCAGOMCO GOOOOOmc WOWTWC OOMKMMC 120 

TGAAOGA GA7CT0093C AOODTGAAOC TOOO9O90TC CMOTCBCTT TOOUS O OW ISO 

TCCTOCTACT 0000GOCCT0 TOOmOW CAAOWJT09T TOATAMCCM CTQAAGMTTt *40 

AfiOATSTCCA CtACATGCTC OOOOOfftta* OOAatTCTTO TCTCTCTOTC 0AA9OCHACA 300 

AA3CTOOCAA AAQAOCTOTA (ITT Uf T OOC T OTOmOAAA < ll T OCC *m 0A9GAT0CTA 3*0 

AAGAfiGAMT OCAOCTCTTC TTOOOOAAM C*OdAACT«C AATQO0O0CA TTOACMCM 420 

CWTTACTOC TOCT0970GA AATOGMCT? ACfrTOCTTOA TCttAJTAOOk AOAMGAMO 410 
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MirtMGQA-r timuam rrrmrewAT tga*;cai;ct nrnxmct mwnurr via 

tccttwcac Trw^r-oa* ccrrTrrcrrrn tcaatooaat cnwrocrA cctogtgcca .100 

a*;tovwct rrrerr.-wrw atcaucwttc ArrrAffrrciAG Tivmm ATwerccrc wo 

•Trrwrcrr Tcrarrrrr; gacjattoaaa tcatmataa attaatctcc attccgtacg 7 20 

TOtfAAATCAC ATTTOOATTG ATGGACOTTT TTOOTOTGAA AOGAGAGGA'f TCTCA7ACCT 7 d0 

WACArtATT CTACATTAAC GGAGGTCAAA AATACAWC OCCTAAAAAT CSCCTATOTTO $40 

AAGGTGATGC CTCAAGCGCA AGCTATTTCT TOSCKWTOC TGGAATTACT CGAOGGACTG * 0 0 

TOAOTOTOGA AooTTdTGoc ACCACGWJT? 7GCAWMA TGTOAAGTTT GCTGAGGTAC MO 

TGGAflWGAT GOGACCGAAO GTTACATGGA GQ3AAACTAG COTAACTOTT ACTOOCCCftC 1020 

C0C5GGW0C ATTTGGGAGG AAAGAOCTGA AflOCOATTGA WICMCW AAGAAGATGC lOtO 

CTGATCTCGC CATGACTCTT OCTGTOGTTG COCTCTTTOC CGAffGOCCCG ACWOCATCA U40 

CMACGTGOC TTCCTOGAfiA GTAAAGGAjQA CC3AGAGOAT GGTTOCCATC GOOAOBOMIC 1200 

TAAGGAAflC? GGOAOGATCT 0T7GAG0AAG GOOOOOftCTA CIGCKTOttC ACBQOGOCQG 1260 

AfiAAOTTOAA CGTGAcmcG ATWSACWT A0OACOAOGA GMGATGQCC ATGGCCTTCT 1320 

CXSTTTGODBC CTOTOCCGAO GTCCOOTCA CCWOaWOA OOCTGGCmC AOCCGGAASA 1380 

OCTTOCOOGA CTACTTOSAT G7GCTGM1GA CTTTOGTGAA OAATTAATAA A90GTOOGA7 1440 

ACTACCACBC AflCTtOATTG AAGTGATAfiG CTTGTGCTOA GOAAAtACAT Hill HUH 1500 

CTGTTTTTCT CTTTCW»GG ATTAAOrrTT GAGTCTGTAA GOTTAOmT mTAGCAAO 1560 

TTTCTATTTC GGATCTTAAfi TTTCTCCACT OTAAGCCAAA TTTCATTTCA AGAffTGdTTC 1«20 

OTTGGAATAA TAAdAATAAT AAATTAOT7 TGA0TOAAAA AAAAAAAAAA AAAAAAAAAA 1C80 

AAAAAAAAAA AAAAAAAAAA AACOCGGGAA T7C 1?13 
12) INFORMATION ran StQ 10 KOl2i 

ID 8EQUEMCE CHAAACrCRISTXCS i 

IA) LCKCITHi 1340 baft palrf 

IB) TYPE: nuclaic acid 
(C) STRANDE0NE98 I dftubla 
(01 TOPOLOGY: ii*aar 

111) MOLECULE TYfE: eONA 

fvl) ORIGINAL SOURCE* 

(A) ORGANISM: Zaa **ya 

IB) ffTKAXNi Black Maxlein ftraat 

(vlll DftEQlATC SOURCE j 

(B) GLOME t pftftA-KL-716 

(1XJ FEATURE; 

(A) WME/ttY: COS 
IB) LOCATION! C..1337 

Ixl) SEQUENCE DCBOUPTIO*! 8EQ ID 110:2: 

0CWG ^ ^ ?f ^ ^ ATC CTB TO <^ °« A«= M0 GAG ATC 
All Gly Ala Glu Olu Ila Vtl Leu Gin Pro Ha Lya Glu Ua 
1 5 io 

TCC GGC AOC GTC AAQ CTG OOO GGG TOO AAG TOG CTT TOO AAC Q3C A*e 
Gly Thr Val Ly. Uu Pro Gly Bar Ly 2? S IU 
15 20 25 j 0 

CTC CTA CTC GGC GOC CTG TCC AM OGG ACA ACA &TQ GTT GAT AAC CTG 
La« Leu Lau Ala Ala Lau 8ar Glu Gly T*r Thr Val Val Aap Asn Lau 
34 «> 43 

CTG AAC AflT GAG GAT GTC CAC TAC ATO CTC GOG GCC TTG AGO ACT CTT 
Lag Aart aar Glu Aap v.l HI. Tyr Hat Lau Gly Ala Lou tag Thr Lau 



47 



95 



143 



191 




so 
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<>0 



?V, fl" - CT !J r ? r ** riAC *** ,iCT "« aaa *» ficr .»a err rrrr 

UU ol « Ala Aap Ly* Ala Al, Lya Arq AH »,i „i 

*E c-f m r ??* rrc cw ctt av; <i*r acr aaa oat; ,jaa arr. cao 

-1/ C/». Oly My (.y 4 Phe Pro V.l Olu A*p Ai, ty- Cl.j Otu v., I Oln 

ere ne tt<i coa aat gct oca act gca am coo cca m aca gca cct 

U« Ph. Uu Gly A.„ A,. ci , Th , Ala H.t™^^^^^ 
" 100 los ll0 

S!T^^^ WWWCC AA«TACaTOCrraA?<iGAGTAOCA 
V.l Thr Ai. AA. Gly Gly A« Al. Thr Ty, V.l Leu Amp Oly Vai pr© 

119 110 lM 

AflAATOA<»OMA«aXATTOOC«MTrO<m(ITCa<WWaAAOCK8 
Aro Met Arg «„ Arg Pre a , Gly L.u V.l vli Sy 2! ™ 

130 134 140 

Uu flly Ai. Aap v.l A.p Cy. Ph. isu flly Thr Cy? ^ £1 

130 Ha 

Ar* V.l aiy u. aiy Qly Utu Fro aiy Cly £ ^ £ 22 IS 
*■« lf& X 70 

Gly S.r II. fl.r l.r Gl„ Ty, ^ fl .r Al, l*u ™ £1 SI ^ 

ItS jgo 

tto gct err aaa oat ©to cms att oaa atc att oat aaa tea atc toe 

UU Al. Leu Oly Ajp Val Glu xx . 01u a . m ^ £ 2 TOC 

200 204 

XU P~ tyr V.1 aiu Not Thr Uu A*, Uu H.t 01u £U £3 vm 

210 *« 220 

Ly. Al. uu HI. s.r a^i s.r ftp a. p at. Ph. tyr 21. l£ i3 
** a 230 245 

CMAMTACAA<ITCCOCTAAAAAiaOCTATOT<UAwa«GOCTqA 
CIA Ly. Tyr Ly. »er pro Ly. Aon Ai. Tyr V.i Olu ai y A.p Al. I^r 
" 9 243 250 

S.r Al. Sor Tyr Ph. L.u Ai. Gly Ai. Ai. XI. Thr Oly vll 
253 aao Z«5 2 io 

JSIS^^^I^^^A^AOTTWCWaO^MTGTeAASTIT 
Thr V.l Olu Gly Cy, aiy Thr Thr S.r Lou am Oly Asp v»l Lyo Ph. 

*" 200 2tS 

S?!J 4 ?? C,MiB * MATea »«»»B"»*OkM8 AOC OAS ACT 
Al. Olu V.1 L.u «lu Ptot H.t Gly Ala Ly* V.l Thr Trp Thr Glu Thr 

"° 2»» 300 

^v™?i^272? W ^ OTWTOo »T"«WAOSAAACAC 
6.r v.l Thr v.l Thr Oiy Pro Pro Ar, Giu Pro Ph. aiy £ £ £S 

JW 110 3X5 

°CT QA1 CTC OGC ATC 

X-Ly. Al. xi. A.pV^H.t^^^^^ja^ 
Thr Xp.U A!. V.1 v.l Al. L.u fh. AX. Aap Gly Pro Thr Al. II. Atq 

JM 340 a*a aao 

A#p V.1 Al. «.r Trp Arg V.l Ly. Gig Thr Glu Ary Nit m All IU 
3 *' 3i0 3C5 

Atq Thr Glu L.U Thr Ly. Lm> Qly Al. B#P y.l Olu Olu Gly 7ro £p 



2Ji 



JJ5 



333 



431 



479 



927 



375 



623 



671 



119 



767 



81 5 



863 



9X1 



9&» 



1007 



105ft 



1103 



1141 



♦ 
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J*'« J?S juo 

TAG TOC ATC ATC ACfJ CCC CCD OAO AAC CTC AAC GTG ACS GC'J ATC GAC H9Q 
tyr Cyt; flu lie Thr Pro Pto niu Ly* Uu A*n Vj i Thr AU (U A*p 
J"*' i'JO jt, 

AC:; 7 AC '^C <UC CAC Aflfl ATf) OCC Af> GCC TTC TCC 7T W n? 
fr.f Tyr A*«p Aw<p HU Ac? H«r. Ala Mot AU Phi* » yf Lru Ai* ALA Cy* 
«™ 405 4l0 

SCC GAG OTC CCC GTC ACC ATC OEM (SAC CCT GQQ TGC ACC CCC AAC ACC 
AU Glu V.l Pro V.l Thr He Arg A»p Pro Gly Cye Thr Arg Ly* Thr 
415 <M 4*5 430 

TTC CCC GAC TAC TTC GAT OTO CTG AGC ACT TTC GTC AAC AA7 
Ph. Pro A*p Tyr Ph. Aep V.l Uu Str Thr Phe Val Lye A*A 
435 440 

7AA 



(2) INFORMATION fQR SEQ ID MOt3: 

(II SEQUENCE OWUCTERISTICS ; 

W LENGTH; 444 eadne molds 
(B| TYPE: amino told 
(01 TCTOUJ6T: Untir 

111) MOLECULE TYPE: prat Bin 

(xll grOUSWCr DESCRIPTION: SCO 10 MO; 3: 

Ala Gly Ala Glu Glu zie Val Leu Oln Pro zi« Lya Glu lit tar Gly 

* B 10 n 

Thr Val Lya Uu pro Oly 8ar Lye Ser Uu Ser Ami Arg Zla Lay Leu 

*o as so 

Leu Ala Ala Uu Ser Glu Gly Thr Thr V.l Val Aep Am Leu Leu Aen 
35 40 45 

8ar Glu Aep Vil Hie Tyr Nat Leu Gly Ala Leu Arg Thr Uu Glv Uu 
50 55 SO 

Ser Val Glu Ala Aep Lyi Ala Ale Lye Arg Ale Val Val v.l oiy cya 

* 5 70 75 Jo 

Gly Gly Lye Phe Pro Val Glu Aep Ala Lye oiu Glu v.l Oln Uu Phe 
85 90 95 

Leu' Gly Aen Ala Oly Thr Ala Net Arg Pro Leu Thr Ala Ala val Thr 
100 105 no 

Ala Ala oiy Gly Aen Ala Thr Tyr Val Uu Aep Gly Vtl Pro Arg Net 
i** 120 J25 

Arg Glu Arg Pro lie Gly Aep Leu Val V.l Oly Uu Lye Oln uu Gly 
130 135 140 

Ale Aep Val Aep Cye Ph. Uu Gly Thr Aep Cye Pro Pro Vtl Arg Val 
145 ISO 135 lio 

Aen Gly lie Gly Gly Uu Pro Gly Gly Lye Vel Lye Uu 8«r Oly Ser 
*•* 170 175 

He Ser Ser Gin Tyr Leu 8er Ala Uu Uu Net Ale Ale Pro Uu Ala 
190 18ft ipo 

Uu Gly Aep Val qi u 21. Glu tie II. Aep Lye Uu lie Ser lie Pro 
m 200 2Q5 

Tyr Val Glu Net Thr Uu Arg Uu Met Glu Arg Phe Gly v»i Lye Ala 
210 215 220 

Glu Hie fi.r Aep ser Trp Aep Arg Phe Tyr Ho Lye Gly Oly Oln Lye 
223 230 233 240 

Tyr Lye Ser Pro Lye Aen Ala Tyr Val Glu Oly Aep Ala Ser Ser Ala 
24 * 250 255 



1295 



1337 



1340 
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:*t Tyr Pn%» uu* Al i r.ly At* Al* IL* Thr nty nty Thr vat Thr Val 
Z*iO Jiii , 70 

<U'i 'Uy C/k 'Uy Thr Thr fl'.»r U-u «ln Gly Adp Vat Ly;; Pho Al« Glu 

2>J0 

* 1 * 'jI'J Mut Mrt Gly Ala Lys Val Thr Trp Thr 'J t j Thr Sor V4 L 
2« J00 

Tf\f Val Thr r.ly p ro Pro Arg Ola Pro Pha Gly Arg Lys HI* Uu Lv* 
30>, 310 J15 320 

Alt I la Asp Val Aan Met Aan Lys Nat Pro A*p Val Ala Mat Thr Uu 

325 330 335 

Ala Val Val Ala Ltu Phe Ala Aap Oly Pro Thr Ala Zl« Arg Aap Vil 
340 )4A 350 

Ala flor Trp Arg Val Lye Glu Thr alu Arg Mat Val All He Are Thr 
3&fi 3«0 365 

Glu Lau Thr Lya Uu Gly Ala 8ar Val Glu Glu Gly Pre A»p Tyr eye 
3^0 375 380 

He Hi Thf Pro Pre Glu Lya Lau Am Val Thr Ala lie Asp Thr Tvr 
390 3f 9 Y 400 

Aap Asp hit Arg Met Alt Mat Ala Pha Bar Leu Ala Ala Cya Alt Glu 

405 410 415 

Val Pro vn ?hr 11. Arg Asp Pro Oly Cya Thr Arg Lya Thr Phe Pre 
420 425 430 

Aap Tyr Pha Aap vn Lau Bar Thr Phe Val Ly» Am 
43ft 440 

(2) INFORMATION TOR 6EQ ID W0l4: 

(1) SCQUEMCE CHARACTERISTICS 1 

(A) LPaTTHi 1340 baae palra 

(B) TYPE J rtuclalc acid 
tC) STRAH 0BDNE88 x double 
(D| TOPOLOGY: linear 

(ill MOLECULE TYPE: CONA 

(vl) ORIGINAL SOURCE: 

(A) QRORHXSKs I., mye 

IB) STRAIN: BleC* Mexican Sweat 

IvUl IMMEDIATE SOURCE: 

(Bl CLONE* pRPA-HL-720 

tlx) RATUREl 

(A) NAKE/KET i COB 

(B) LOCATION: £..1337 

1X1) SEQUENCE DESCRIPTION: SEQ ID NOk4: 

OCATG GOC GGC OOC GAG OAS ATC 070 CTG CM COC ATC AAO GAG ATC 47 
Ala Gly Ala Glu Glu He Val Uu Gin Pro He Lya Glu I la 
1 S 10 

TOO GGC AOC QTC AAG OTO GOO 003 TCC AMI TOO CTT TOC ARC COG ATC » 
$mt Gly Thr Val Lya Lau Pro Gly Bar Lya for Uu Bar Aan Arg 21a 
15 20 25 30 

WCTACTCOOCOOCCTOTCCflAaOGOAW ACA OTG OTT OAT AAC CTG 143 
Uu Lau Lau Ala Ala Lau Sor Glu Gly Thr Thr Val Val Aap Aan Lau 
33 40 43 

CTO AAC ACT GAG OAT GTC CAC TAC ATO CTC OOO OCC TW AGG ACT CTT 191 
Lau Aan Bar Glu Aap Val Hla Tyr Mot Uu Oly Ala Uu Arg thr Uu 
*0 55 «0 

GOT CTC TCT GTC QAA GOG OAC AAA OCT OOC AAA AAA OCT OTA OTT ©IT 231 
Gly Lau Bar Val Glu Ala Aap Lya Ala Ala Lya Arg Ala Val Val Val 

« 70 T5 



34 



one tot rvrt 
;ily cv- Oly 

•10 

rrf'; rrc mi 

I.e'j Phv Luu 



OTT ACT 
V*L 7hr Ala 



AGA ATO AGO 
Arg hoc Arg 



CTT GGT GCA 
Liu Gly Ala 
145 

GOT OTC MT 
Arg Val Aan 
160 

GOC TCC ATC 
Oly 9ar Ila 

175 

tto ocr err 

Lau Ala Lau 



At? 000 TAC 
Ila Pro Tyr 



AAA OCA. GAG 
Lya Ala Olu 
225 

CAA AAA TAC 
Gin Ly» Tyr 
240 

AOC OCA AOC 
$ar Ala Sor 
255 

ACT 0T0 OAA 
Thr Val fllu 



OCT OAG OTA 
Alt Olu Val 



AOC OTA ACT 
Mr Val Thr 
305 

CTC aao oca 
Lau Lya Ala 
320 

ACT CTT OCT 
Thr Lou Ala 
395 

OAC OTG OCT 
Aap val Ala 



COO ACS OAO 
Arg Thr Olu 



TAC TOC ATC 
Tyr Cya Xla 
395 



no* aao 



f»VI AAT 
illy A. .(i 



<tct ncrr 
Ala Cly 

OAO AOA 
Olu Arg 
130 

OAT CTT 
Asp Val 



OOA ATC 
Oly Hi 



AOC AO? 
8ar a*c 



000 GAT 
Oly Asp 
115 

OTC OAA 

val olu 
210 

CAT TCT 
His flar 



AAA TCC 
Ly« 8a r 

tat rrc 

Tyr ?hi 



GOT TOT 
Oly Cya 
275 

era OAO 
Lau Olu 
210 

OTT ACT 
Val Thr 



ATT OAT 
I la Aap 



OTO OTT 
Val Val 



Tee TOO 
flar Trp 
355 

CTA AOC 
Leu Thr 
370 

ATC AGO 
21a Thr 



•rrc oca 

k»ha Pro 

ocr <irA 
AU Oly 

ion 

OOA AAT 
Oly Aan 



COC ATT 
fro He 



OAT TOT 
Aap Cya 



Oly Oly 
165 

CAB TAC 

01a Tyr 
100 

oro OAO 
Val Olu 



ATO ACA 
Mat Thr 



OAT AOC 

Aap oar 



OCT AAA 
fro Lya 
245 

TTO OCT 
Lau Ala 
260 

OOC ACC 
Oly Thr 



rrrr oaO r,AT 
vai Olu A.tp 



ATC riCA ATM 
U- AM Nat 



OCA ACT TAC 
Ala Thr Tyr 
IZO 

GOC fiAC TTO 
Oly Aap Lau 
135 

TTC CTT OOC 
•he Leu Oly 

150 

CTA OCT OOT 
Lau Pro Oly 



TTO Aflft OOC 
Lau flar Ala 



ATT OAA ATC 
Xla Olu Xla 
200 

TTO ABA TTO 
Lau Arg Lau 

215 

TOO GAC AAA 
Trp Aap Arg 
230 

AAT OOC TAT 
Aan Ala Tyr 



ATO ATO 
Mat Hat 



OOC CCA 
Oly Fro 



OTC AAC 
Val Aan 
325 

OOC CTC 
Ala Lau 
340 



GOT OCT GCA 
Gly Ala Ala 



ACC AOT TTO 
Thr 8ar Lau 
200 

GOA GOO AAO 
Oly Ala Lya 
2tft 

GOB C90 OAO 
Pro Arg Olu 
310 

ATO AAC AAO 
Mat Aart Lya 



TTT OCC OAT 
Pha Ala Aap 



OTA 
Arg val 



AAO CTO 
Lya Leu 



GOO COO 
Pro Pro 



Lya Olu Thr 
360 

OOA OCA TCT 
Oly Ala Oar 
375 

GAO AAO CTO 
Olu Lya Lau 
3*0 



OCT AAA 
All Lyu 

Ara Scr 

<rra err 

Vai U»u 



OTT OTC 
Val Val 



ACT OAC 
Thr Aap 



OOC AM 
Oly Lya 
170 

tto era 

Lau Lau 
105 

ATT OAT 
Xla Asp 



ATO GAG 
Nat Glu 



TTC TAC 
Pha Tyr 



OTT OAA 
Val GlU 
250 

ATT ACT 
Xla Thr 
265 

GAG OOT 
Gin Gly 



OTT ACA 
Val Thr 



CCA TTT 
Pro Pha 



ATG CCT 
Mat Pro 
330 

OOC GCO 
Oly Pro 
34ft 

OAS AGO 

Olu Arg 



OTT GAG 
Val Glu 



AAC OTTO 
A*ft Val 



OAG OAA OTO CAG 2B*» 
Olu Olu Val Cln 



rro ACA OCA OCT JJS 

U:\J ITXC AU AU 

no 

OAT OGA OTA CCA J8J 
Asp 01/ Val Pro 
ICS 

OOA TTO AAA CAO 431 
Oly Lau Lya Olft 
140 

TOC CCA CCT OTT 479 

Cya Pro Pro Val 

155 

OTC AAA CTO TCT 527 
Val Lya Lau flar 



ATG OCT OCT GCt 575 
Mat Ala Ala Pro 
190 

AAA TTA ATC TOC 623 
Lya Lau 11a Sar 
205 

COT TTT GCT CTG 671 
Arg Pha Oly Val 
220 

ATT AAO GOA GOT 719 

Xla Lya Oly Oly 

235 

GOT OAT OCC TCA 767 
Gly Aap Ala flar 



GGA GGC ACT CTO 815 
Oly Oly Thr Val 
270 

OAT GTG AAG TTT 863 
Aap Val Lya Pha 
283 

TOO AOC GAG ACT 911 
Trp Thr Olu Thr 
300 

000 AGO AAA GAC 959 

Gly Arg Lya Hla 

315 

GAT GTC GOC ATG 1007 
Aap Val Ala Mat 



ACA OOC ATC AGA 1055 
thr Ala Xla Arg 
350 

ATO OTT GOO ATO 1103 
Mat Val Ala Xla 
36ft 

OAA GOG OOC GAC 1151 
Glu Gly Pro Aap 
300 

AOS GOB ATC GAC 1199 

Thr Ala Xla Aap 

39S 
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AO, TAC TAC <iAC CAC NYl AT'I COT AT« «CC TTC 70C CTT GCC CCC TGT I'M 7 

.nr T/r Aip Anp «|;< Arg Her Ala Met Ala Ph« 9er L«u Al* Aio Cvs 
<0* 4 10 

r:c- w gtc ccc mc acc atc crv; gac cct tgc acc osn aag acc i?o% 
An v.il Pro vn Thr He Arq Ajp Prj Oly C Y * Tnr Arj Lya Thr 
1=0 125 OO 

TTC CCC GAC TAC TTC GAT CTG CTG AGC ACT TTC OTC AAG AAT 1J17 
Pht Pro Asp Tyr Phe Asp Vtl Leu Sec Thr Phe VH Lya Ajn 
4J6 440 



TAA 

(21 INFORMATION FOR 8ZQ 10 NO: 5: 

(1) SEQUENCE CHARACTERISTICS i 
(A) LENGTHS 444 talno a 
IB) TYPE J talno acid 
(0) TOPOLOGY: lintir 

(ill MOLECULE TYPE: protain 

lull SEQUENCE DESCRIPTION: SEQ ID NOi*; 

Ala Gly All Glu Glu lie Vtl Uu Gin Pro lie Ly* Glu n. $9t CXy 

1 * 10 is 

thr Val Lye Uu Pro Gly sar Lye S*r Leu Ser Aan An n # Leu Leu 
20 23 30 

Uu Ale Ala Uu Ger Glu Gly Thr Thr Vil Vii Aip am Uu Uu Aan 
35 40 43 

sar aiu Asp vn his tyr Mtt Uu Cly Ala Uu Arg Thr Uu Gly Leu 
SO 55 40 

Sar Vel Glu Ale Aap Lye Ale All Lya Aty Ale Val Vel Vel Gly Cye 
«* *0 73 80 

Cly Gly Ly» Pht Pro Val Glu Asp Alt Lya Glu Glu Vil Gin Uu Pha 
89 90 |3 

Leu Gly Asn Ale Gly He Ale Not Arg Ser Uu Thr Ale All Vel Thr 
100 105 ll0 

Ale Ala Gly Gly Aan Alt Thr Tyr Vtl Ltu Aep Gly Vel Pro Arg Het 

115 120 123 

Arg Glu Arg Pro He Gly Aip Leu Vtl Vtl Gly Uu Lya Gin Leu Gly 
130 US i4 0 

Alt Aap Vtl Aep Cys Pht Uu Oly Thr Aep Cye Pro Pro Vtl Arg Vel 
145 i»o 155 i W 

Aan Gly He Gly Gly Uu Pro Gly Gly Lye Vel Lya Uu 8tr Gly Ser 
1«5 170 175 

lie fler Ser Gin Tyr Uu Ser All Uu Uu Met Alt Ala Pro Uu Ale 
1*° lt5 ito 

Uu Gly Aep vtl Glu lit Gly lit lit Aep Lys Leu He Ser lit Pro 
lt3 200 205 

Tyr Vtl Glu Met Thr Uu Arg Uu Met Glu Arg Phe Oly Vtl Lva Alt 
210 213 220 

Glu Hie Ser Asp Ser Trp Asp Arg Pha Tyr Ho Lya Gly Gly Gin Lye 
223 2*0 235 240 

Tyr Lye fler Pro Lys Asa Ala Tyr Vtl Glu Gly Aep Alt Ser Sar Alt 
2«3 250 235 

9tr tyr Ptie Uu Ale Gly Alt Ale He Thr Gly Gly Thr Vel Thr Vtl 
200 2C3 170 

Glu Gly Cya Gly Thr Thr Str Uu Gin Gly Aap Vel Lys Pha Ala Glu 
2" 200 203 



1340 
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V.il U«u Glu Mut Hvt Oly Ai* Ly* VaI fhr Trp Thr r)lu Thr Ser Va\ 
210 2**', 300 

Thr Vil Thr Oly l»ro Pro Afi atu Pru Pha Gly An Ly- His Urn Lyu 

jio jis jro 

A 1.1 :tr <V;fc Vni A;;r\ M*t Aan Lya Hvt ?zv A*p V.U AM M*t Thr Uu 
Ji'i .iJO Jit 

Ala Val vai Ala Leu Pha Ala Asp Oly Pro Thr Ait II* Am A*p Val 
340 345 3*6 

Ala Bsr Trp Arg Vel Lya Glu Thr Glu Arg Met Val Ala lis Arg Thr 
355 300 345 

Glu Lsu Thr Lys Leu Gly Ala Ssr Vel Glu Glu Gly Pro Asp Tyr Cy» 
370 375 3t0 

lis lit Thr Pro Pro aiu Ly* Leu Aon Val Thr All lis Asp Thr Tyr 
3i6 990 3t5 400 

Asp Asp His Arg Kat Ala Mat Ais Pha ser Lau Ala Ala Cys Ala Glu 
405 410 415 

Vsl Pro Val Thr lis Arg Asp pro Gly Cy» Thr Arg Lys Thr Phs Pro 
420 4X5 430 

Asp Tyr Pha Asp Vsl l*u Sar TUT f** Vsl Lye Asn 
435 440 



